The AI Ladder by Rob Thomas
Author:Rob Thomas
Language: eng
Format: epub
Publisher: O'Reilly Media
Published: 2020-04-29T00:00:00+00:00
The Challenges of Collecting New Sources of High-Volume Unstructured Data
Managing data that arrives in real time requires new kinds of tools, such as Apache Kafka, which facilitates streaming data into databases. And unstructured data frequently requires new kinds of databases. As we saw in the previous chapter, RDBMSs can only store data effectively if that data’s structure is specified in a schema. If your incoming data is unstructured (or its shape is unknown), there’s no schema to work with. This is where NoSQL databases come in. The term NoSQL is starting to become out of favor, but it’s important to realize that the technologies referenced by this term are not; they provide other ways to manage data—document databases, graph databases, column stores, key value stores—and those tools will more than likely become part of your information agenda. Many of them are optimized for high-speed write access, which is exactly what you need for real-time data.
Getting access to all kinds of data means being “polyglot,” not in terms of programming language, but in terms of data storage and models. One of the toughest tasks on the first rung of the AI Ladder is designing uniform ways for everyone across the company to access the data. SQL has become a common language across all kinds of databases, including many NoSQL databases. Build web services that allow online access to databases from any point in the organization, using technology that’s already available at everyone’s desk (or, for that matter, a lunchroom, conference center, or airliner). And cloud providers can create robust data collections that can be accessed worldwide and are resistant (though not immune) to outages.
The technical aspects of data access are solvable. What about the organizational aspects?
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Computer Vision & Pattern Recognition | Expert Systems |
Intelligence & Semantics | Machine Theory |
Natural Language Processing | Neural Networks |
Algorithms of the Intelligent Web by Haralambos Marmanis;Dmitry Babenko(8064)
Hadoop in Practice by Alex Holmes(5793)
Jquery UI in Action : Master the concepts Of Jquery UI: A Step By Step Approach by ANMOL GOYAL(5640)
Test-Driven Development with Java by Alan Mellor(5044)
Life 3.0: Being Human in the Age of Artificial Intelligence by Tegmark Max(4886)
Data Augmentation with Python by Duc Haba(4870)
Principles of Data Fabric by Sonia Mezzetta(4687)
Learn Blender Simulations the Right Way by Stephen Pearson(4469)
Microservices with Spring Boot 3 and Spring Cloud by Magnus Larsson(4457)
Big Data Analysis with Python by Ivan Marin(4414)
Functional Programming in JavaScript by Mantyla Dan(3876)
RPA Solution Architect's Handbook by Sachin Sahgal(3850)
The Age of Surveillance Capitalism by Shoshana Zuboff(3660)
The Infinite Retina by Robert Scoble Irena Cronin(3577)
Pretrain Vision and Large Language Models in Python by Emily Webber(3391)
Infrastructure as Code for Beginners by Russ McKendrick(3197)
Deep Learning with PyTorch Lightning by Kunal Sawarkar(3145)
Blockchain Basics by Daniel Drescher(3079)
The Rosie Effect by Graeme Simsion(2923)